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Motivation 


Chip  multiprocessors  are  the  way  to  deal 
with  increasing  computational  load  in 
embedded  real-time  systems 

•  Power  consumption,  heat  dissipation,  and  other 
physical  constraints  render  single  processors 
impractical 


Power  consumption  is  a  concern  in  battery- 
powered  real-time  systems 

•  battery  life  time 

•  battery  weight 
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Overview 


Voltage  and  frequency  scaling  (VFS)  allows  reducing  the  power  consumption  of 
a  processor,  and  its  speed. 

VFS  in  real-time  systems  must  ensure  that  the  system  remains  schedulable. 

Existing  VFS  algorithms  for  multiprocessors  leave  unused  idle  capacity. 

•  processor  constraints  Freci  A 

•  algorithm  constraints 


Growing  Minimum  Frequency  (GMF)  algorithm  achieves  better  power  efficiency. 

•  removes  algorithm  constraint 

•  reduces  impact  of  processor  constraints 


unused  idle 


capacity 


Problem  Description 


Given: 

•  multiprocessor  platform  supporting  independent  VFS,  and 

•  a  set  of  periodic  tasks  with  implicit  deadlines 

Compute: 

•  frequency  assignment  that  minimizes  power  consumption  while  meeting 
tasks’  deadlines 
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Processor  Power-Frequency  Relationship 


Dynamic  power  in  processors  is  proportional  to  the  product  of  the 
processor  frequency  and  the  square  of  the  supply  voltage. 

P  cc  V2  f 


In  VFS,  power  can  be  reduced  by 
reducing  the  frequency,  which 
allows  a  corresponding  reduction 
in  the  voltage. 


Since  voltage  is  proportional  to  the 
frequency  we  can  approximate  as 


P*f 
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Task  and  Platform  Model 


Tasks 

n  :  number  of  tasks 

Q  :  execution  time  of  task  measured  at  the  highest  frequency 
Tt  :  period  of  task  t, 

Dt  =  To  implicit  deadlines 
ut  =  Ci/T. :  utilization  of  task  r ,■ 

U  =  :  total  utilization 

Platform: 

m  :  number  of  processors  (all  identical) 

fi  :  normalized  frequency  (1  =  highest  frequency)  for  processor  / 
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Uniform  Frequency  Scaling 


•  All  processors  assigned  the  same  frequency 

•  Tasks  scheduled  with  an  optimal  global  scheduler  (e.g.,  LNREF) 


/  =  max(t//m,u1,  ...,un) 


Non-Uniform  Frequency  Scaling  i 


•  Processor  frequencies  are  assigned  independently 


Decide  Independent  Frequency  algorithm  avoids  heavy  task  bottleneck 
[Funaoka  2008] 

•  Task  t i  is  heavy  if  its  utilization  would  drive  up  the  uniform  frequency 


assignment  for  the  remaining  processors,  i.e.  u;  > 


S"=i 

m-i+l 


•  Each  heavy  task  is  assigned  its  own  processor 

•  Remaining  light  tasks  globally  scheduled  in  remaining  processors  with 
uniform  frequency  assignment 
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Non-Uniform  Frequency  Scaling  2 


Decide  Independent  Frequency  is  optimal  if  frequency  can  be  scaled 
continuously  (i.e.  to  any  frequency  in  a  range) 


Freq 


Freq 


uniform  frequency 
scaling  for  remaining 
tasks  and  processors 
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Discrete  Frequency  Steps 


However,  processors  support  a  limited  number  of  frequencies. 

•  DIF  is  not  optimal  in  that  setting 

•  Computing  the  optimal  partition  of  processors  and  the  frequency 
assignment  is  NP-Hard 


Example:  supported 
frequencies:  1,  .75,  .5 


■13%  @  .75^ 


Idle  capacity 
isolated  by 
partition 
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Achieving  Better  Power  Efficiency 


Two  problems 

•  Discrete  frequency  steps  force  leaving  idle  capacity  in  processor  partitions 

•  Unused  capacity  in  a  processor  partition  cannot  be  used  by  tasks 
assigned  to  other  partitions 

Observation:  if  we  can  optimally  schedule  tasks  allowing  them  to 
migrate  between  processors  running  at  different  frequencies  we  can  do 
better 

•  avoid  the  set  partition  problem  (and  its  computational  complexity) 

•  achieve  better  power  efficiency 

-  no  fragmentation  of  platform  capacity 

-  capacity  left  by  heavy  tasks  is  not  wasted 
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U-LLREF 


U-LLREF  [Funk  2010]  is  an  optimal  global  scheduling  algorithm  for 
uniform  multiprocessors 

•  an  extension  of  LLREF  (a  DP-fair  algorithm) 

•  processors  can  run  at  different  frequencies 

A  task  set  is  schedulable  by  U-LLREF  on  a  platform  if  the  following 
holds 

k  k 

V/c  e  {1,  ...,m  —  1} 

i= 1  i= 1 

n  m 

^  ^  '  fi 
i= 1  i= 1 

where  u±  >  •••  >  un  and  f\  >  •••  >  fm 
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Growing  Minimum  Frequency  Algorithm 


Overview:  satisfy  each  condition  of  the  U-LLREF  test  using  the  most 
power  efficient  assignment  of  frequencies  (lowest  possible  and 
distributed  as  uniformly  as  possible) 

assign  the  lowest  frequency  to  all  the  processors 

for  k  =  1  to  m  do 

while  kth  U-LLREF  condition  not  satisfied  do 

increase  the  frequency  of  the  slowest 
processors  in  subset  l..k  to  the  next 
frequency  step 

end  while 

end  for 
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GMF  Example  i 


k=l 


slowest 

O 


-  Software  Engineering  Institute 


ut  =1.0  > 
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GMF  Example  2 


k=l 


slowest 

O 


-  Software  Engineering  Institute 


ut  =1.0  > 
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GMF  Example  3 


k=l 


slowest 

O 


-  Software  Engineering  Institute 


1/,  =1.0  = 
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GMF  Example  4 


k  =  2 


slowest 
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GMF  Example  5 


k  =  2 


slowest 
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GMF  Example  6 


k  =  2 


slowest 

O 


-  Software  Engineering  Institute 


k 


i=l 


if,  =2.0  + 
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GMF  Example  6 


k  =  3 


slowest 
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GMF  Example  7 


k  =  4  =  m 


slowest 

O 


-  Software  Engineering  Institute 


&-3-1 

1=1 


m 


>  Zy;=3.o 


i= 1 
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GMF  Example  8 


k  =  4  =  m 


slowest 

O 


-  Software  Engineering  Institute 


i>,= 3.1  < 

i=l 
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GMF  Example  9 


In  this  case,  frequency  assignment  is  the  same  as  in  the  Exhaustive 
partition  search. 


Ty  t2  t3  t4  t5 
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Evaluation 


Randomly  generated  15,000  tasksets 

•  utilization  level  ranging  from  0.5  to  4  in  steps  of  0.25 

•  1,000  tasksets  for  each  utilization  level 

•  each  taskset  composed  of  tasks  with  random  uniform  utilization 

Used  frequencies  and  voltages  of  three  different  quad-core  processors 

Computed  frequency  assignment  and  corresponding  power  with 
different  multiprocessor  VFS  algorithms 

•  Decide  Independent  Frequency  [Funaoka  2008] 

•  Exhaustive  partition/frequency  assignment  search 

•  GMF 

•  Optimal  (exhaustive  frequency  assignment  w/o  partitions) 
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Optimality  i 


GMF  is  optimal  when  the  supported  frequency  steps  are  uniform. 


Proof  intuition: 

•  Any  frequency  step  we  choose  is 
the  same  in  terms  of  speed 

•  Increasing  the  frequency  of  the 
slowest  processor  requires  the 
smallest  power  increase 

•  The  optimal  frequency  assignment 
for  the  first  /  conditions  bounds  from 
below  the  optimal  assignment  for 
the !+]  conditions 

•  GMF  assigns  frequencies  as  even 
as  possible  within  that  bound 
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Optimality  2 


With  non-uniform  frequency  steps,  GMF  may  not  optimal 


We  have  observed  that  for  some  platforms  with  non-uniform  frequency 
steps  GMF  is  still  optimal 

•  When  the  power  steps  associated  frequency  steps  are  non-decreasing 
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Conclusion 


Growing  Minimum  Frequency  (GMF)  algorithm  computes  the  optimal 
frequency  assignment  to  minimize  the  power  consumption  of  a  real-time 
periodic  taskset  in  a  multiprocessor  platform. 


Evaluation  results  show  up  to  30%  improvement  over  previous 
algorithms. 


Avoiding  partitioning  allows  GMF  to  achieve  better  power  efficiency  than 
optimal  partitioned  approaches. 
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